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Abstract 

We consider a branching-selection system in R with TV particles which give 
birth independently at rate 1 and where after each birth the leftmost particle is 
erased, keeping the number of particles constant. We show that, as N — > oo, the 
empirical measure process associated to the system converges in distribution to a 
deterministic measure- valued process whose densities solve a free boundary integro- 
differential equation. We also show that this equation has a unique traveling wave 
solution traveling at speed c or no such solution depending on whether c > a or 
c < a, where a is the asymptotic speed of the branching random walk obtained by 
ignoring the removal of the leftmost particles in our process. The traveling wave 
solutions correspond to solutions of Wiener- Hopf equations. 

1 Introduction and statement of the results 

We will consider the following branching-selection particle system. At any time t we 
have N particles on the real line with positions r/^ (1) > ■ ■ ■ > rj^(N). Each one of the N 
particles gives birth at rate 1 to a new particle whose position is chosen, relative to the 
parent particle, using a given probability distribution p on R. Whenever a new particle 
is born, we reorder the N + 1 particles and erase the leftmost one (so the number of 
particles is always kept equal to N). We will denote by Xat = {(£(1), • • • ,£(N)) G R^ : 
£(1) > ■ ■ ■ > the state space of our process. 

We learned of this process through the work of Durrett and Mayberry (2008), who 
considered the special case in which p corresponds to a uniform random variable on 
[—1,1]. However, our process is a member of a family of processes that first arose in 
work of Brunet and Derrida (1997), who studied discrete approximations of the Fisher- 
Kolmogorov PDE: 

dh d 2 h , ,„ 

ai = w +h - h - 

In their discretization they have iV particles at each integer time k with integer positions 
xi(k), . . . , xjy(k). At each time k they choose, for each i G {1, . . . , k}, two particles ji 
and j[ at random (and with replacement) from {1, . . . , N} and set 

Xi(k + 1) = max{xjv(£;) + a^x^ik) + a-}, 
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where c^, a' { , 1 < i < N, are independent = 1 with probability p and = with probability 
1—p (and are resampled at each time step) . 

They conjectured that the propagation speed of the finite system converged to the 
minimal wave speed of the PDE at a slow rate, of order (log N)~ 2 . Berard and Gouere 
(2008) considered a discrete time version of the process where at each time step each of 
the N particles are replaced by two new particles, and then only the N rightmost ones 
are kept. Under some assumptions on the distribution used to choose the locations of 
the new particles (a strong one being that new particles are always sent to the right of 
the parent particle) they proved that the system moves to the right with a deterministic 
asymptotic speed vjy which converges as N — > oo to some explicit v at the predicted 
rate. 

Although we will say something about the behavior of the system for fixed N, our 
main interest in this paper is to study the behavior of the empirical distribution of the 
process as iV — > oo. Before proceeding with this, let us specify some assumptions. When 
a particle at x gives birth, the new particle is sent to a location x + y with y being chosen 
from an absolutely continuous probability distribution p(y)dy. We will assume that p 
is symmetric and that \x\p(x) dx < oo. The initial condition for our process will 
always be specified as follows: each particle starts at a location chosen independently 
from a probability measure fo(x)dx, where fo(x) = for x < and fo(x) is strictly 
positive and continuous for x > 0. 



1.1 Convergence to the solution of a free boundary problem 

Let 

1 N 

i=i 

be the empirical measure associated to 77^. Observe that the initial empirical measure 
Vq(dx) converges in distribution to fo(x)dx. We will show that, as iV — > 00, this 
empirical measure process converges to a deterministic measure-valued process whose 
densities are the solution of a certain free boundary problem. 

Alternatively one could think of the following (weaker) version of the problem. It 
is not hard to see that the probability measure E(f t Ar (-)) on R is absolutely continuous. 
Letting / (t,x) be its density, then we want to study its limit as N — > 00. We expect 
this limit f(t,x) to correspond to the densities of the limiting measure- valued process 
mentioned above. Now observe that if is a version of our process in which we 
do not erase the leftmost particle after births (i.e., £^ is a branching random walk), 
then we would expect that the density of the corresponding expected empirical measure 
converges to the solution f(t, x) of the following integro-differential equation: 

d? f°° - 

(i-i) = J f( t >y)p( x ~y) d y- 

This is indeed the case, as we will see in Proposition 2.1. Observe that the total mass 
of f(t, •) grows exponentially (in fact f(t, y) dy = e t ). 

By adding the selection step to our process we ensure that the limiting density always 
has mass 1, but otherwise the branching mechanism is still governed by the convolution 
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term appearing in (1.1). Thus we expect that if the limit f(t,x) = lirn/v->oo f N (t, x) 
exists, then it has to satisfy the following: there exists a continuous non-decreasing 
function 7: [0, 00) — ► K with 7(0) = such that (f(t,x),j(t)) is the unique solution 
to the following free boundary problem (FB): 

(FBI) %(t,x)= f°° f(t,y)p(x-y)dy Vx> 7 (i) 



CO 



dt 

(FB2) / f(t,y)dy = l 

hit) 

(FB3) f(t,x) = Vx<j{t) 

with initial condition f(0,x) = fo(x) for all x £ R. j(t) is a moving boundary which 
keeps the mass of f(t, •) at 1, but the speed at which it moves is not known in advance 
and depends in turn on /. 

It is not a priori obvious that (FB) has a solution, let alone that such a solution is 
unique. We will prove the existence and uniqueness using arguments closely related to 
the ones we will use to prove the existence of the limiting density. 

We will denote by V the space of probability measures on K, which we endow with 
the topology of weak convergence, and by D([0,T],V) the space of cadlag functions 
from [0, T] to V endowed with the Skorohod topology. 

Theorem 1. For any fixed T > the sequence of V -valued processes on [0, T] 
converges in distribution in D([0,T],V) to a deterministic vt £ D([0, T], V). vt is ab- 
solutely continuous with respect to the Lebesgue measure for every t £ [0, T] and the 
corresponding densities f(t,-) are characterized by the following: there exists a contin- 
uous non- decreasing function 7 : [0, 00) — ► M. with 7(0) = such that (f(t, x), 7(i)) 
is the unique solution of the free boundary problem (FB). In particular for x > "f(t), 
f(t, x) is strictly positive, jointly continuous in t and x and differentiable in t. 

Let us remark that there are at least two other examples in the literature of particle 
systems converging to the solution of a free boundary equation, but in both cases the 
limiting equation is of a different type. Chayes and Swindle (1996) study a particle 
system corresponding to a liquid-solid system with an interface subject to melting and 
freezing, while Gravner and Quastel (2000) study an internal diffusion limited aggrega- 
tion model. In both cases an hydrodynamic limit is proved with the limiting equation 
corresponding to the famous Stefan problem, which involves free boundary problems 
for the heat equation where the moving boundary separates the solid and liquid phases 
(see Meirmanov (1992) and references therein for more on this problem). 

1.2 Behavior of the finite system 

To study the finite system it will be useful to introduce the shifted process A$ , which 
we define as follows: 

Af = (Af(l),...,Af(iV)) with Af(i) = rfV) - r?f(iV). 

Observe that A^(N) is always 0. It is clear that A^ is also a Markov process, and its 
transitions are the same as those of rj^ except that after erasing the leftmost particle 
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the N remaining particles are shifted to the left so that the new leftmost one lies at the 
origin. 

We will denote by minre^ = rj^(N) and max 77^ = rj^(l) the locations of the leftmost 
and rightmost particles in rff . 

Theorem 2. For every fixed N > the following hold: 

(a) There is an aj\r > such that 

N N 
mm m , max re 
hm — = hm — = aN 

t—*oo t t— >oo t 

almost surely and in L 1 . Moreover, the sequence (a^Nyo is non- decreasing. 

(b) The process Af has a unique stationary distribution /ijy ; which is absolutely con- 
tinuous. 

(c) For any (random or deterministic) initial condition Vq we have 

||F-(Af e ■) - m (-)\\ TV ^ 0. 

From this point on we will assume that the displacement distribution p has expo- 
nential decay. To be precise, we assume that there is an a > such that 

(1.2) p(x)<Ce~ a ^. 

for some C > 0. We will write 



= sup \a> 0: sup [e a l x lp(a:)] < co \ 



That is, G G (0, 00] is the maximal exponential rate of decay of p in the sense that 
p(x) < Ce~ ax for some C > when a < Q but not when a > 0. may be 00 (as 
in the cases where p has compact support or p corresponds to a normal distribution), 
while > is ensured by (1.2). 

Now let 4>{9) = e 6x p(x) dx be the moment generating function of the displace- 
ment distribution p. (1.2) and our definition of imply that (f)(0) < 00 for £ (—0, 0). 
To avoid unnecesary technical complications will make the following extra assumption, 
which in particular implies that (f)(0) = 00 for \0\ > 0: 

This assumption always holds when = 00: choosing < l\ < I2 so that p(x) > M for 
some M > and all x G [l±, I2] we get 

- (°° e d *p(x) dx > WhpU jh v ^ 

-00 9^oo 



Our next result will relate the asymptotic propagation speed ajy of our process r\ t 



N 



with the asympotic speed of the rightmost particle in the branching random walk ^ . 
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Theorem 3. 

lim oat = a, 

AT— >oo 

where a is the asymptotic speed of the rightmost particle in ^ , i.e., in a branching 
random walk where particles branch at rate 1 and their offspring are displaced by an 
amount chosen according to p. 

Remark. If p is uniform on [—1,1] this follows from (8) in Durrett and Mayberry 
(2008). However, the couplings on which the proof are based extend easily to our more 
general setting, so we do not give the details of the proof. 

The speed a has an explicit expression (see Biggins (1977)): by standard results of 
the theory of large deviations, if St is a continuous time random walk jumping at rate 
1 and with jump distribution p, then the limit 

(1.4) A(x) = lim -logP(Sf > xt) 

t— >oo t 

exists and equals — (sup e>0 {a;^ — 4>(0)} + 1); a is given then by the formula A(a) = — 1 
(see Durrett and Mayberry (2008) for more on this). 



1.3 Traveling wave solutions 

A traveling wave solution of (FB) is a solution of the form f(t,x) = w(x — ct) and 
7(t) = ct for some c > (with initial condition fo(x) = w(x)). 
If w is a traveling wave solution, then from (FBI) we get 

roo 

cw'(z) = — w(y)p(z — y) dy V z > 0, 



o 



so integrating from z = x to oo we deduce that w must solve 



oo 



'1.5a) w(x) = - I 



o 



'1.5b) / w(x) dx = 1, 



oo 







1.5c) w(x)=0 \/x<0. 

1.5d) w(x) > Vx>0, 



R(x) = / p(y) dy 



where 



is the tail distribution of p. On the other hand, it is easy to check that if w satisfies 
(1.5) and fo(x) = w(x), then (w(x — ct),ct) is the solution of (FB). 

(1.5a) is known as a Wiener-Hopf equation. Equations of this type have been stud- 
ied since at least the 1920's (at the time in relation with the theory of radiative energy 
transfer), and have since been extensively studied and found relevance in diverse prob- 
lems in mathematical physics and probability. In general, these equations can be solved 
using the Wiener-Hopf method, which was introduced in Wiener and Hopf (1931) (see 
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Chapter 4 of Paley and Wiener (1987) and also Krein (1962)). But the solutions pro- 
vided by this method are not necessarily positive, so they are not useful in our setting. 
Instead, we will rely on the results of Spitzer (1957), who studied these equations via 
probabilistic methods in the case where R(x)/c is a probability kernel. 

To do that we need to convert our equation to one where the kernel with respect to 
which we integrate is a probability kernel. To that end, we need to make the following 
observation. In Lemma 4.1 we will show that there is a A* G (0, 0) such that 

(1.6) — - = mm — - — = a, 

v ; A* AG(0,e) A 

where a is the asymptotic speed introduced in Theorem 3. On the other hand, the 
function A i— > <f>(\)/\ is continuous and goes to oo as A — > (and moreover, as we 
will show in Lemma 4.1, it is decreasing on (0, A*)). Thus for every c > a there is a 
A E (0, A*] such that 4>{X)/\ = c. 

Observe that the tail distribution R of p has the same decay as p (see (1.2)). That 
is, for every < a < O there is a C > so that 

(1.7) R(x)<Ce~ ax Vx>0. 

Fix c > a and use the above observation to pick a A 6 (0, A*] such that ^>(A)/A = c. 
(1.7) implies that the function x \— ► e Xx R(x) is in L 1 (M). Moreover, integration by parts 
yields 



/oo 
■ 
-oo 



e Ax R{x) dx 



A 

Therefore 

is a probability kernel. On the other hand, if w is a solution of (1.5) with c = <j){\)/\ 
then it is easy to check that u{x) = e Xx w(x) satisfies 

/"OO 

(1.8) u(x) = / u(y)k(x — y) dy Vx > 0. 



JO 

Thus the idea will be to recover solutions of (1.5) from positive solutions of (1.8). 

Positive solutions of (1.8) can be regarded as densities of stationary measures for the 
following Markov chain. Let £ n be a sequence of i.i.d. random variables with distribution 
given by k, let Xq = and define 

X n +\ = (X n + £ n ) + , 

where x + = max{0, x}. This chain appears, for example, in the study of ladder variables 
for a random walk (see Chapter XII of Feller (1971)) and in the study of the GI/G/1 
queue (see Chapter 5 of Durrett (2004)). If u satisfies (1.8), it is on the negative 
half-line and it is non-negative on the positive half-line, then the measure (supported 
on [0, oo)) having u as its density is invariant for X n . Assuming that E(|£i|) < oo, X n 
is recurrent, null-recurrent or transient according to whether E(£i) is negative, zero or 
positive. As we will see in Section 4, this expectation is negative in our case if and only 



6 



if A < A* and it is zero for A = A*. In both cases the theory of recurrent Harris chains 
suggests (and Theorem 4.2 will prove) that there exists a unique (up to multiplicative 
constant) invariant measure for X n , although this measure may not be finite in the null- 
recurrent case. The difference between the recurrent and null-recurrent cases explains 
the difference between the cases c > a and c = a in Theorem 4 below. The fact that 
the chain is transient when A > A* suggests that there are no positive solutions of (1.8) 
for these values of A (again see Theorem 4.2 for a proof). This in turn hints at the 
possibility that there are no solutions of (1.5) for c < a. Our proof of this fact will not 
rely in seeing (1.5) as a Wiener-Hopf equation, but instead will use explicitly the fact 
that its solutions are traveling wave solutions of (FB). 

Theorem 4. Assume that (1.2) and (1.3) hold. 

(a) If c > a the equation (1.5) has a unique solution w. This solution is differentiate 
except at the origin. 

(b) When c > a, and letting A S (0, A*) be such that 4>(\)/\ = c, the solution satisfies 
J °° e Xx w(x) dx < oo (which, in particular, implies that w(y) dy = 0(e~ Xx )). 

Moreover, if A > A then sup x e Xx J°° w{y) dy = oo. 

(c) When c = a the solution satisfies f£ e x * y w(y) dy = 0(x). 

(d) If c < a the equation (1.5) has no solution. 

We remark that, when c > a, the solution w given by the theorem can be obtained 
by the following limiting procedure. Take < A < A* as in the above statement and let 
uo be the density of any non-negative random variable whose distribution is absolutely 
continuous. Now let wq(x) = e~ Xx uo(x) and then for n > 1 let 

1 f°° 

w n+ i(x) = - w n (y)R(x - y) dy for x > 0. 
c Jo 

Then the limit w 00 (x) = lim n _> 00 w n (x) exists and defines an integrable continuous 
function. The solution w is then given by w(x) = Kw OCJ (x) with K > chosen so that 
w integrates to 1. The fact that w has this representation follows from the results of 
Spitzer (see Theorem 4.2). 

The rest of the paper is devoted to proofs, with one section devoted to each one of 
the proofs of Theorems 1, 2 and 4. 

2 Proof of Theorem 1 

Most of the work in the proof of Theorem 1 will correspond to showing that for each 
fixed t > the tail distribution of our process at that time, F N (t,x) = u]r([x,oo)), 
converges (almost surely and in L 1 ) to a deterministic limit F(t,x) corresponding to 
the tail distribution of a random variable and that, moreover, the limit F(t, x) has a 
density f(t,x) (that is F(t,x) = f(t,y) dy) which solves (FB). 

To get started we need to introduce some auxiliary processes. Without loss of 
generality we may work during all this section with r)t defined on the time interval 
[0, T] for some given T, and for simplicity we will actually assume that T = 1. 

First, recall that ^ is the same process as 77^ but with no killing and let be the 
associated empirical measure. Let M be the space of finite measures on M, endowed 
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with the topology of weak convergence, and let C([0, 1],7W) and D([0, 1],M) be the 
spaces of continuous and cadlag functions from [0, 1] to A4 endowed respectively with 
the uniform and Skorohod topologies. 



Proposition 2.1. Fix T > 0. Then the process t>t converges in distribution in 
D([0,1],V) to a deterministic % in C([0, 1],"P) which for each t E [0,1] is absolutely 
continuous with respect to the Lebesgue measure. If we denote the density ofv t by f(t,x) 
then f(t,x) is the unique solution to the integro- differential equation 

(2.1) ^(t,x) = J°° f(t,y)p(x-y)dy 

on [0, 1] with initial condition /(0, x) = fo(x). 

Proof. By Theorem 5.3 of Fournier and Meleard (2004) we have that Dj^ converges 
in distribution to a deterministic Dt in C([0, which is the unique solution of the 
following system: for all bounded and measurable (p, 

/oo roo ft roo poo 

(p(x)9 t (dx)= ip(x)V (dx)+ / / ip(y)p(x - y)dyv s (dx)ds. 
-oo J— oo Jo J— ooJ— oo 

Moreover, by Proposition 5.4 of the same paper, u t is absolutely continuous for all 
t G [0, 1], and hence its density /(t, x) must satisfy jf(0, x) = fo(x) and 

-17 <p(x)f(t,x)dx= / tp(y)p(x - y)f(t,x)dydx. 

®" J —oo J— ooJ — oo 

Taking ip = l[ ZyZ+ h]i dividing by h, taking h — > and using the symmetry of p we deduce 
that / satisfies (FBI) at z and the result follows. □ 

Recalling that fo(x) > for x > it is clear that if / solves (2.1) then f(t,x) > 
for all t G [0, 1] and x e R. 

For each k > we define the modified process r]f' k inductively as follows. For each 
m = 0, . . . , 2 k — 1, run the process with no killing on the interval [p-, ^^-) and then at 

time repeatedly delete the leftmost particle until there are only iV left. Let u^' k be 
the empirical measure associated with this process (we consider here the cadlag version 
of this process, and we do the same for other processes and functions defined below). 
To make clear the distinction between the particles we erase in and those we erase in 
this modified process at dyadic times, we will refer to this last procedure as shaving off 

the extra mass in r] t ,k . Our goal now is to study the convergence of the tail distribution 

c N ,k 
of v t . 

Let us first describe what will turn out to be the limit. It is defined inductively on 
each of the dyadic subintervals of [0, 1]. We let f k (0, x) = fo(x). If we have constructed 
f k up to time ^ for some m G {0, . . . , 2 k - 1} then for t G 3^) we let f k (t, x) be 
the solution of 

a fk roc 

(2.3) ~jr^ x)= / f k ^y)p^-y) d y- 
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Then at time we let X^ +1 be such that 



A m+1 



f h {(^)-,y)dy 



and define 



/ fc (^,*) = / fc (m-^)w 



m+1 



We will denote by F N,k the tail distribution of ' , that is, 

F N >»(t,x)=v?> k ([x,oo)), 
and by F k be the tail distribution of v k . 

Remark. To avoid confusion (and notational complications) we will use the following 
convention: upper-case superscripts, such as in F N (t,x), refer to quantities associated 
with our stochastic process rjf, while lower-case superscripts, such as in F k (t,x) refer 
to the deterministic quantities we just introduced. 

Proposition 2.2. For every fixed k > 1 and every t G [0,1] and x G R, F N ' (t,x) 
converges in probability to F k (t,x). 

Proof. Proposition 2.1 implies that i/^' ([x, oo)) converges in probability to v k ([x,oo)) 
for all t E [0, p-) and all Using the partial order in .M given by fi ^ z/ if and only 

if oo)) < u([x, oo)) for all x G R, it is clear that the mappings 1 1— > v\ and 1 1— > z/^' 
are increasing on [0, and thus the limits lim^^fe v k = u k and lim^x^fc ' = v_' 
exist (for z^'^ these statements hold almost surely). Using the inequalities 

(z^-z^)(R) < e'-e s and 

e((^*-0(R)|^*) 



(which follow from the comment after (1.1)) and the fact that v_ is absolutely contin- 
uous, it is easy to see that v^ ,k ([x, oo)) converges in probability to z^.([:c, oo)) for all 
x. Since x \— > v^((x, oo)) is strictly decreasing at x = X k , the location of the point at 
which the mass of u t ' k is shaved off at time converges in probability to X k . The 
result for t = ^ follows from this, and induction gives the desired result. □ 

Proposition 2.3. For every given t G [0,1] and F N,k (t,x) converges in proba- 

bility to F N (t, x) as k — > oo, uniformly in N. 

The proof depends on the following lemma: 

Lemma 2.4. We can couple nf' k and rjj^ (starting with the same initial configuration) 
in such a way that the following holds: for every N > 1, k > 1 and m G {0, . . . , 2 k }, 

E(l^ /2fe A^ 2fe |)<2iV-^I_2--, 

where AAB = A\B U B\A. 
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Proof. The coupling will be constructed inductively on each dyadic subinterval of [0, 1]. 
We start both processes with the same initial configuration. The idea will be to use 
the same branching times and displacements whenever possible. To do this we will 
decompose r)^' k in the following way 

fn a\ N.k nN,k . . n 7V,fc , . r>N,k 

(2.4) rj t ' = G t > U D t U B t , 

where the unions are disjoint, G t ' k C rjf will correspond to "good particles" (i.e. 
coupled particles), B^' will correspond to "bad particles" (i.e. uncoupled particles) 
and D^' k will correspond to "dangerous particles" (i.e. particles which might become 
bad if not erased). 

To get started we take G$' k = t}q and D^' k = B^ ,k = 0. Let < t\ < T2 < ■ ■ ■ < 1 
be the sequence of branching times for particles in r]^ (there are almost surely finitely 
many such times) and let I m € N be such that rj m is the last branching before time 
For each i < I\ we check whether the branching at time Tj < occurs at a 

particle which is in G^ ,k . If that is the case, we add the new particle to G^' fc and then 
we look for the particle erased from r]^. after this branching and move it from G^' k 
to D t ' . By our decomposition this results in rj t ' having a branching at each time 
Ti and using the same displacements as in rj^ to choose the location of the newborn 
particles. The difference is that in r]^ the leftmost particle is erased, while in rj t ' k the 
leftmost particle is moved to the set of dangerous particles. At the same time, we let 
each particle in Df' k U B^' k branch at rate 1 and use p to choose the displacements 
for their offspring. Whenever there is such a branching we put the newborn particle 
in B^ ,k . In this manner, every particle that is in rf t ' C\r)^ undergoes branchings at 
the same times and uses the same displacements for the newborn particles, while the 
particles in r]^ ,k \r]^ branch independently and all their offspring is labelled as bad. At 
time 7^- we need to shave off the extra mass in rj t ' . Observe that, by our construction, 
all dangerous particles are erased (though we will, in general, also erase good and bad 
particles). 

We continue the coupling inductively following the same procedure. For 1 < m < 
2 k — 1 we will have a decomposition n \ nh = G ' /ni . U D U B 'iL (in fact, D 

1 'm/2 K m/2 K m/2 K m/2 K v ' m/2 K 

will be empty, as we already mentioned for the case m = 1, and by the discussion below 
for m > 1). We consider the branching times Tj for I m < i < I m +i and use these for 
the shared branchings (i.e., for those particles in G t ' ) on the time interval (S, ^l* )■ 
Whenever the erased particle in rj^ is also in G^' k , we move it from that set to D^' k . We 
keep branching the remaining particles (those in r/^ Ar]^ ,k ) independently as described 
above. At time we need to shave off the extra mass in r] t ' . As before, we may 
erase particles of each of the three types, but it is no longer necessarily the case that all 
dangerous particles will be erased. In any case, the total number of dangerous and bad 
particles remaining must be at most the number of bad particles before shaving. To see 
this, observe that each dangerous particle corresponds to a branching which increased 
by one the number of particles in rjf' k , so to each dangerous particle there corresponds 
some particle which will be erased when shaving; the corresponding particle to be erased 
is possibly the dangerous particle itself, and otherwise it has to be a bad particle because 
all good particles are to the right of any dangerous particle. After shaving off the mass 
we relabel all the remaining dangerous particles as bad. 
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An important consequence of the preceding is that the sequence (l-^ m / 2 fc|) m=0 2 k 
is non-decreasing (because the total number of particles at these times is always ./V and 
the number of dangerous particles at these times is 0, while the number of good particles 
cannot grow). 



On each dyadic subinterval the pair 



(df,b+) which evolves as follows: 



D^' k \,\B" ,K \) is dominated by a process 



4 



4 + 1 
bi + i 



at rate N 
at rate d\ + b k 



(the second rate is the correct one, but the first one is not necessarily so, because the 
number of good particles is in general less than or equal to N). Using the fact that d\ is 
always at the beginning of each dyadic subinterval, this implies that E(b k ) must solve 
dE(b$)/dt = Nt + E(b k ) on these intervals. The solution of the ODE y'(t) = Nt + y(t) 
is y(t) = (y(0) + N)e t - N(l + t), and thus for § < t < we obtain 



(2.5) 



E$) = (E(b k m/2k )+N)e t 



2 k 

m 
-717 



N(t 



Since 6q = we deduce that E(i£ 



(1/2*)- 



N(e 2 



1) < N2' 2k . At time X 



we need to shave off the extra mass in J7^ 2 *\_ and this leaves us with d k , 2k = and 



o 1/2fc S o,. 



(1/2*)- 



Repeating this argument we get 



E(b k /2k )<E(b k 2/2k) _) = N[(l + 2 
and inductively we deduce that 



-2k )e 2~ 



(2" fe + l)] < N(l + e 2 k )2~ 2k 



(2.6) 



E{b\) < N 



E 

j=0 



J2- 



-2A- 



N- 



Since by our construction there are no dangerous particles at time 1 we have \ rj^ Ar]^ 



JV.fci 



2\B^' k \ and the result follows for m 
the same holds for m < 2 k . 



m/2 k 



I) 



m=o,...,2* is non-decreasing, 

□ 



Since [\B_, 

Proof of Proposition 2.3. Fix k > for a moment and assume that t £ [S, 2k /■ 
Using the coupling introduced in the proof Lemma 2.4 we have that E^I^^fcl) < 



" m m+1 ' 



N2~ 2k (e- l)/(e z 



1) and thus (2.5) implies that 



E 



JV,fe 



2E 



2E 



AT,fe 



< 2N(t-§) +2N 



N2 -k+i + N 



1 + 



-2A- 



a t-m/2 k 



(t-S)-l 



+ 1 



-2fc+l 



Observing that 

F N > k (t,x)-F N (t,x) 



1 



AT 
i=l 



A' 



1 \ - 

< — > 1 N, k 



8=1 
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we deduce by Markov's inequality and the above estimate that 



F N ' k (t,x)-F iy (t,x) 



2 -fc+i + 



1 



+ 12 



-2fe+l 



k— >oo 



uniformly in iV. 
Lemma 2.5. 



□ 



F k (t,x) > F k+1 (t,x) for allte [0,1], x € R, k > 1. 

Proof. Fix A; > 1 and The result is trivial at t = 0. We will work inductively on 

the intervals 2Z |lr]. Take < m < 2 k - 1 and assume that 

F fc (f,x)>F fc+1 (f,x). 

Then writing H = F k — F k+l we have for t G (§, ^) that 

<9# 



dt 



\t,x) 



oo />oo 



(2.7) 



oo 

OO POO 



-OO J X 

oo 



/*(t jU )-/ JW - 1 (t, U )Jp(y--u) dndy 

y - v) - f k+1 (t, y - vj\ p(v) dy dv 



H(t, x — v)p(v) dv 



H(t, z)p(x — z) dz, 



so H satisfies (FBI) on this interval and thus, since i?(p-,-) > 0, we get Hit,-) > 
for t G (§, At time ^ = f + ^ the density is shaved off, leaving 

-F fc (^Fir,^) > F fc+1 (^^-,x). Repeating the above argument we get 



(2. 



F k (t,x) > F k+1 (t,x) for all t G ( 7 m m+1% 



. 2 fc + 1 ' 2 fc + 1 



1) 



Now at time both densities / fc and are shaved off, say at points x^ and x^ 
respectively. Then by (2.8), xj. > x^+i, and thus (2.8) holds at t = pqrr as well. □ 

Since F k {t,x) is decreasing and positive, we can define 

F(t,x) = lim F k {t,x). 

k— >oo 

It is obvious that for each given t, F(t, ■) is non-increasing and its range is [0, 1]. 

Proposition 2.6. For every t G [0, 1] and x G R, F N (t, x) converges almost surely and 
in L 1 as N — > oo to F(t,x). 

Proof. First observe that, for fixed t G [0,1] and x G K and since F N (t,x) < 1, the 
sequence of random variables [F N (t, x)) N>Q is uniformly integrable, so it is enough to 
show that F N (t, x) -> F(t, x) in probability. 

Fix e > 0. Use Proposition 2.2 to choose, for each k > 0, an JVjt > so that 



F^ fc (t,x) -F fe (i,x) 



> 



< 



12 



for every N > and Nk T °°- Define kjq as follows: kjss = 1 f° r N < N\ and fc/v — k 
for iV fc < N < JVfc+i. We have 



F 



AT,fe» 



(t,x) -F(i,x) 



>e < 



e 

>2 



F N ' kN (t,x) -F kN {t,x 

+ p( F kN (t,x) - F(t,x) 



£ 

y 2 



By the definition of fcjv, the first term on the right hand side is less than 1/fcjv, while 
the second one is when kjy is large enoug h. We deduce that F N ' kN (t, x) converges in 
probability as N — > oo to F(t,x). 

To finish the proof write 



(\F N (t,x)- F(t,x)\ >e) <P F N (t,x)-F N ' kN (t,x 



+ 



F 



> 



(t,x)-F(t,x) 



> 



We already know that the second term on the right hand side goes to 0, while the fisrt 
one goes to thanks to Proposition 2.3 (here we use the fact that the convergence is 
uniform in N). □ 



Recall the definition of the shaving points and let X k : [0, 1] 
corresponding linear interpolation, that is, 



be the 



vk vk 

X k (t)=X k m + m+ \ k m 



for f < t < mgk. 



Lemma 2.7. X k {t) converges uniformly in [0,1] to a continuous function j(t). 

Proof. We will start by showing that the sequence of functions (X fc ) fe>Q is relatively 
compact. By the Arzela-Ascoli Theorem, we only need to show that our sequence is 
uniformly bounded and equicontinuous. 

Observe that, for each given k, X k (t) is increasing. Indeed, it is enough to show 
that Xi < X^ +1 for < m < 2 k , and this follows from the fact that, if f k (^, •) > 0, 
then (FBI) implies that f k (t, •) > / fc (f , •) for § < t < Therefore 

sup sup X k (t) =supX k (l). 
fe>ote[o,i] fe>o 

To show that this last sugremum is finite, observe that f(t, x) (which was defined in 
Proposition 2.1) satisfies f(t,x) > f k (t,x) for all k. On the other hand, 



t roo 



f(t,x)dx = 1 + 



/(s, x) dx ds 



whence it is easy to see that 
(2.9) 



fit, x) dx = e*. 



Therefore, if M > is such that J™ f(l,x) dx < 1 we deduce that X k (l) < M for all 
k and the uniform boundedness follows. 
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For the equicontinuity we need to show that given any e > there is a 5 > such 



that 



sup 

fc>o 



X k (t) -X k (s) 



< e 



whenever \t — s\ < 5. Assume that s < t, fix k for a moment and let ^ and ^ be the 
dyadic numbers immediately to the right of s and t, respectively (here we assume k is 
large enough so that mV I < 2 k ). Then 
(2.10) 

X k (t)-X k (s) < X k (t)-X k (%) + X k (%)-X k U) + X k {±)-X k {s 



< 



2 fc ) > \2 k ' ^2 k/ ^2 A 

X k {^)-X k {%) + X k (^)-X k (^) + X k ( : 
fcl 



Now for any p, q & {0, . . . , 2 } with q > p we have 
(2.11) 



f k (%,y)dy 



f k a,y)-f k (%,y) 



dy 



f k (£,y)dy. 



The first term on the right hand side equals 1. The second term corresponds to the 
amount of mass accumulated by f k to the right of X k (-j^) on the time interval (^-, 
Using (FBI) it is not hard to see that this is bounded by the same quantity with f k 
replaced by /, which gives the bound 



f($,y)-f($,y) dy<e^-e^ <e 



q-p 

2 k 



thanks to (2.9). On the other hand, using the fact that X k (t) is increasing, it is clear 
that f k {jk,x) > f k {0,x) = f {x) for x > X k (^r). Therefore the last term on (2.11) is 
greater than or equal to 



fo(y)dy. 



Now X k {^) is non-negative and bounded by M by the preceding arguments, so the 
last integral is at least 



L = inf 

xe[o,M] 



X+E 



fo(y)dy > 



where we used the fact that /o is strictly positive on the positive half-line. Putting the 
last two bounds together with (2.11) we get 



f k (±,y)dy<l + e^-L. 



Now \t — s\ < 5 implies that < 5 + i, and thus we deduce that 



POO 

/ f k (%,y)dy<l 
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for small enough 5 and large enough k and for (p,q) E {(1,1 + 1)) {jn,m + 1), 

The preceding means that if 5 is small enough and K is large enough then \X k (-^) — 

X k (-^)\ < e for k> K and for these three pairs (p,q). Using (2.10) we obtain 



sup 

k>K 



X k (t)-X k (s) 



< e 



if \t — s\ < 5 and 5 is small enough. Since the functions X k are all uniformly continuous 
(on [0,1]), it is clear that, by choosing 5 even smaller if necessary, the same will hold 
also for k = 1, . . . ,K — 1. This finishes the proof of the equicontinuity. 

The last thing we need to show is that our sequence has a unique limit point. 
Consider two convergent subsequences X nk — > 71 and X mk — > 72. Let t = ^ be any 
dyadic number in [0, 1] and assume that k is large enough so that n& A > I. Recall 
from the proof of Lemma 2.5 that X k (t) is non-increasing in k for each fixed t £ [0, 1]. 
Since F nk (t, x) = 1 for all x < X nk (t) we deduce that 

F nk (t,x) = l for all x < 71 (t). 

Now given any k there is a k! such that n/%/ > m&, so by Lemma 2.5 we get 

1 = F nk '{t,x) < F mk (t,x) < 1 for all x < 7l (t). 

This means that X mk (t) > 71 (t) for all large enough k, and taking k — > 00 we deduce 
that 72(i) > 71 (t). By symmetry we get 71 (i) > 72 (t)- This gives 71 (t) = 72 (t) for all 
dyadic t £ [0, 1], and now the uniqueness follows from the continuity of 71 and 72. □ 

Now we make the following observation. Suppose that (g(t, x), j(t)) solves (FB) and 
let G(t, x) = g(t, y) dy. Then it is not difficult to check, repeating the arguments 
leading to (2.7), that (G(t,x),j(t)) must solve the following free boundary problem 
(FB'): 

dG f°° 

(FBI) -(t,x) = J G(t,y)p(x-y)dy Vx> 7 (i) 

(FB2') G{t,x) = l Vx < 7(f) 

with initial condition G(0,x) = J°° fo{y)dy. Moreover, if (G(t, x), 7 (i)) solves (FB') 
and G(t,-) is absolutely continuous for all t, then (g(t, x), j{t)), where g(t,-) is the 
density of G(t, •), must solve (FB). 

Proposition 2.8. F(t, x) is differentiable in t for all x > 7(f) and it satisfies (FB') 

Proof. We already proved that (F(t,x),j(t)) satisfies (FB2'). For x > "y(t) and by the 
definition of F k (t,x) (which implies that F k (t,x) is differentiable inside each dyadic 



15 



subinterval) we may write 

n k (t) 



F k (t, x) = F k (0, x) + [F k {f k ,x) - F k (^,x) 

dF k 



+ 



m=l 

n k {t) 

F k (o,x) + jr 



m=l 



m-l ra\ ds 
ok > 9 k> 



(s, x) ds 



+ 



F k (t,x)-F k (n k (t),x) 
dF k 



( 2jW it) ds 



(s, x) ds 



n k (t) 

+ Y, [F k (^,x)-F k ((^)-x)] + [F k (t,x)-F k (t-,x) 



m=l 

where n k (t) = Recalling that X k (t) j j(t), we can take k large enough so that 

j(t) < X k (t) < x. Since X k (s) is increasing in s we deduce that X k (^) < x for 
to = 1, . . . ,7ifc(£), and therefore all the terms in the last line above are 0. On the 
other hand, observe that F k must solve (FBI) on each dyadic subinterval, which can 
be checked repeating again the calculations in (2.7). Therefore, 



n k (t) 

F k (t,x) = F k (0,x) + Y, 



m=l ( 



m — 1 m \ I „ 
— j — ,— r ) ^ — OO 



F k (s,y)p(x -y)dy ds 



p poo 

+ / / F k (s,y)p(x-y)dyds 

pt poo 

(2.12) =F k (0,x)+ / F(s,y)p(x-y)dyds 

JO J -oo 

F k (s,y) - F(s,y) p(x - y) dy ds 



n k (t) 



+ 



m — 1 m 
2 fc ' 2 fc 
OO 



m=l ( 



(^,t)./-oo 



F k {s,y) - F(s,y) p(x-y)dyds. 



Now for fixed y, F k (-,y) is a decreasing sequence converging to F(-,y), so Dini's Theorem 
implies that 



A k (y)= sup F fc (i,y) - F(t,y) 
te[o,i] 



k^oo 



0. 



The sum of the terms on the last two lines of (2.12) is bounded by 



n k (t) 

£ 

m=l 



m— 1 m 
2 fc '2 k 



A k {y)p{x -y)dy ds + 

POD 

< t / A k (y)p(x - y)dy, 



A k (y)p(x - y)dyds 



and this last integral goes to as k — > oo by the dominated convergence theorem, 
because, using (2.9) we get A k (y) < sup te [ 01 ] F (i, y) < f(l, z) dz < e. Using this 
and taking k — > oo in (2.12) we get 



t fOO 



(2.13) 



F(t,x) = F(0,x) + 



F(s,y)p(x - y)dyds. 



J-oo 
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To finish the proof it is enough to show that the mapping s i— > F(s, y)p(x-y) dy 
is continuous, since if that is the case then we can differentiate (2.13) and deduce (FBI). 
This actually follows easily from (2.13): 



F(s + h,y)p(x - y)dy 



(2.14) 



< 



F(s,y)p(x - y)dy 

oo 

oo [-s+h /"oo 

F(r, z)p(y — z)p(x — y) dz dr dy 

/s+h roo roo 
/ / p(y — z )p( x — y) dz dy dr = h. □ 
J — oo J — oo 



Let v t be the probability measure defined by Ut([x, oo)) = F(t,x). Since F satisfies 
(FB') we have that for every b > a > j(t), 



d_ 
~dt 



vt{[a,b]) 



u t ([a-y,b-y])p(y)dy 



and thus by standard measure theory arguments we deduce that for every bounded and 
measurable (p with support contained in (7(i), oo), 



(2.15) 



d_ 
~dt 



<p(y)vt(dy) 



ip{x + y)p(y)v t (dx) dy 



(cf. (2.2)). Now if A C (7(t),oo) has zero Lebesgue measure and the support of cp is 
contained in A, then the right-hand side above is and we deduce that ft (^4) is constant. 
Since vo(A) = f A fo(x)dx = 0, we have proved that v t is absolutely continuous with 
respect to the Lebesgue measure. We will denote its density by f(t, •), and we obviously 
have F(t, x) = f(t, y) dy. 

At this point we are ready to finish the proof of Theorem 1 by showing that F 
satisfies the desired properties and then using the convergence of the tail distributions 
F N (t,x) to obtain the convergence in distribution of the process in D([0, 1],V). 

Proof of Theorem 1. We already know that F N (t, x) — > F(t, x) almost surely as N — > oo 
and that F can be written in terms of the integral of /. Now, by (2.13), 



5 roo ft roo roo 

f(t,y)dy= / Hy)dy+ / / / 

Jx Jo J —ooJ y 

for x > 7(t), so for any h > we have 
1 " 



f(s,z)p(x -y)dzdyds 



(2.16) 



f(t,y)dy 



x+h 



f{t,y)dy 

x+h 



t roo 



fo(y)dy 



x+h—u 



f(s, z) dz p(u) du ds. 



The first integral on the right-hand side goes to fo{x) as h — > (recall that fo(x) is 
continuous for x > 0). The second term goes to /q*/^, f(s, x — u)p(u)duds by the 
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dominated convergence theorem. On the other hand, the left-hand side of (2.16) goes 
9F 

dx 



to 7&(t,x), which equals —f(t,x) for almost every x > j(t). We deduce that 



ft fOO 

(2.17) f(t,x) = f (x)+ / / f(s,y)p(x-y)dyds 

J0 J-oo 

for almost every x. 

Now if x n — ► x, then by (2.17) 

rt poo 

\f(t,x n ) - f(t,x)\ < \fo(x n ) - f (x)\ + / / f(s,y)\p(x n - y) - p(x - y)\ dyds. 

JO J -oo 

The first term on the right side goes to as n — ► oo because /o is continuous, while 
the second term goes to by the continuity and boundedness of p and the dominated 
convergence theorem. We deduce that f(t,x) is continuous in x and thus, in particular, 
(2.17) holds for every x. Thus (f(t,x), , y(t)) satisfies (FB). Moreover, since the above 
convergence can be achieved uniformly for t in compact intervals, it is easy to see that, 
if f(t,x) is continuous in t for x ^ *y(t) (as we will show next), then it is actually 
jointly continuous in t and x outside the curve {(t, 7(i)) : t > 0}. The fact that f(t,x) 
is differentiable (and thus continuous) in t for x ^ ^(t) follows easily from (2.17) by 
repeating the arguments in (2.14). f(t,x) > for x > ^(t) thanks to the facts that 
fo(x) > for x > and that 7 is non-decreasing because, according to the evolution 
defined by (FB), f(t,x) only decreases when x = 7(i). 

The only thing left to show before turning to the proof of the convergence in distribu- 
tion of vf in D([0, l],V) is that (f(t, x),~/(t)) is the unique solution of (FB). To do this, it 
is enough to show that if (h(t, x),a(t)) is any given solution and H(t, x) = f°° h(t, y) dy 
then H(t, x) = F(t, x). Indeed, if that is the case then the above arguments imply that 
h(t,x) is jointly continuous in t and x outside the curve {(t, 7(t)) : t > 0}, and thus 
h(t,x) = f(t,x), while (FB2) and (FB3) imply that a(t) = <y{t). 

The idea of the proof will be to compare h(t, x) and f k (t, x) by adapting the coupling 
introduced in the proof of Lemma 2.4 to the deterministic system, so we will sketch the 
main ideas and leave the details to the reader. We will write 

f k (t,x)=g k (t,x)+d k (t,x)+b k (t,x), 

which is to be interpreted in a manner analogous to (2.4). We construct these three 
functions inductively on each dyadic interval. We start with g k (0,x) = fo(x) and 
d k (0,x) = b k {0,x) = for all x. Next, for t G (0, i) we let the three functions evolve 
according to the following system: 



g (t,y)p(x - y)dy for x > a(t) 



00 



g k (t,x)=0 for x < a(t) 
(2.18) dd k 



dd f°° 

— {t,x) = l x < a (t) J g k (t,y)p(x - y)dy 

db k f' 00 r 1 

— (t,x)= / \d k (t,y) + b k (t,y)\ p(x-y)dy 
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Then it is clear that f k (t,x) satisfies (2.3) on this interval as required. At time ^ we 
need to shave off the extra mass in f k . Observe that g k ((-^)—,x) = h((^jr) — ,x) for all 
x, so X k > a(^). Thus all the mass to the left of cr(p-) needs to be erased, so we put 
d k (jfr, x) = for all x. The rest of the mass to be erased from f k will come from both 
g k and b k . This leaves us with 

g k (^,x) <h(^,x), d k (^,x)=0, b k (j,,x)>0. 



We continue the construction inductively. Assume that the above holds at time 



in 



for some 1 < m < 2 k . On the interval (p-, ^j-) we let g k , d k and b k evolve according 
to (2.18). At time we need to shave off the extra mass in f k , and as in the proof 
of Lemma 2.4 it is not hard to see that after doing that we may rebalance d k and b k in 
such a way that d k (~jr,x) = for all x and the total mass of b k ( n ^-, x) is at most the 
total mass of b k {{^^-) — , x). Observe that this construction preserves the inequality 
g k (t,x) < h(t,x) thanks to the observation following the proof of Proposition 2.1. 

Now repeating the arguments leading to (2.5) we deduce that the total bad mass 

?<c 



bt = f^co b k (t, x) dx satisfies bf < (1 + b^w)^ z k — (t — |£ + 1), and thus 



-k . e-1 _ 2fc 



e- 

as in (2.6). Since the sequence (J>m/2 k ) m= o 2 k * s increasing we deduce that 

— k 

SU P Kn/2 k ~ ► 0. 

m=0,...,2 fc fe ~ > °° 

Now let t = ^ be any dyadic number in [0, 1] and assume that k > I. Using the fact 
that d k (t, x) = for all x E R we can write 

(2.19) = (h(t, y) - f k (t, y)) dy = (h(t, y) - g k (t, y)) dy - j™b k (t, y) dy 

so (h(t,y) — y fc (t,y)) dy < b\ — > 0. Thus, since h(t,y) > g k (t,y), we deduce that 
f°° y fc (i, y) <iy — ► /i(£, y) dy as A; — > oo uniformly in x, which in turn implies by (2.19) 
that 

roc roo 

lim F k (t,x)= lim / f k (t,y)dy= h{t,y) dy = H(t,x) 

uniformly in x and for every dyadic number t G [0, 1]. Since (i? (i, x), cr(t)) solves (FB'), 
H is continuous in t and we deduce that H(t,x) = F(t,x) for all t G [0, 1]. 

To show that the sequence of processes converges in distribution in D([0, 1] , V) to 
the deterministic process vt in this space defined by having its densities evolve according 
to (FB), it is enough to prove that this sequence is tight. In fact, if v t k is any convergent 
subsequence, then its limit Vt is completely defined by its tail distribution at each time t, 
which we know must be F(t, •). To show that vf is tight it is enough, by Theorem 2.1 of 
Roelly-Coppoletta (1986), to show that for any continuous and bounded function tp on R 
the sequence of real-valued processes (v^, <p) = <p(%) (dx) is tight in D([0, T], R). 
Fix one such function ip. By Aldous' criterion (which we take from Theorem 2.2.2 in 
Joffe and Metivier (1986) and the corollary that preceeds it in page 34), we need to 
prove that the following two conditions hold: 
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(i) For every rational t E [0, T] and every e > 0, there is an L > such that 



sup P(| (1/^)1 >L) <e. 

7V>0 

(ii) If Xjf is the collection of stopping times with respect to the natural filtration 
associated to (z/^ , 99) that are almost surely bounded by T, then for every e > 

limlimsup sup P( Wu. +s)AT ,<p) - {v? , <p)) > e) = 0. 

The first condition holds trivially in our case by taking L > ||y||oo- To get the second 
one fix JV > 0, £ > 0, < s < r and r E and let K be the number of branchings in 
7]^ on the interval [r, (t + s) AT]. Observing that 



AT 



and E(if) < A^s < Nr, we deduce by Markov's inequality that 

'2\\tp 



P(|(^)/o'.V>-<^V»|>e)<p(- 
and (ii) follows. 



TV 



1 - — iiv — < ^T~ 



□ 



3 Proof of the results for the finite system 

Now we turn to the properties of the finite system. Let us first give an explicit con- 
struction of our process. Consider an i.i.d. family (Uf*)i>i with uniform distribution on 
{1, . . . , N} and an i.i.d. family (Ri)i>i with distribution p and let (?i)j>i be the jump 
times of a Poisson process with rate 1. To construct r)^ we proceed as follows. Assume 
that r$(l) > ■ ■ ■ > rjff(N). At time Ti, branch r]^(U^) using Ri for the displacement, 
erase the leftmost particle, and then relabel the particles to keep the ordering. Continue 
this inductively, using the variables Uf and Ri to determine the branching at time Tj. 

This construction allows us to give a monotone coupling for two copies of the process. 
As in the proof of Proposition 2.2, for /x, v E A4 we will say that fi ^ v whenever 
fj*([x, 00)) < u([x, 00)) for all x E R. Observe that if fi = Yli=i ^ w ith x\ > • • • > X7V1 
and v = Yli=i ^Vi with y\ > • ■ ■ > yjy 2 then \i < v \i and only if A 7 ! < A2 and x.j < y.j 
for i = 1, . . . , N±. It is easy to check that if rff 1 and are two copies of our process 
(note that we allow them to have different total number of particles) with t]q X ■< 
(i.e., in the sense that Yls^i $ , a ~< 5,iv 9 ,.A then if we use the same branching 

\ ' »j 1 (i) — *-~<t— 1 i Q (1)' 

times and displacements and the same uniform variables for the particles in rj^ 1 and 
the leftmost Ni particles in rff 2 , then we have r/f 1 * 2 for all t > 0. 

For most of the proof of Theorem 2 it will more convenient to work with the discrete 
time version of our process rj^ which is defined as follows: at each time step, choose one 
particle uniformly at random, and then branch that particle and remove the leftmost 
particle among the N + l. The variables Uf and Ri can be used to decide which particle 
to branch and where to send its offspring at each time step. 
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Proof of Theorem 2(a). We will first prove that each of the two limits exists with prob- 
ability 1 and in L 1 and that the limits are non-random. We will do this for the discrete 
time process and leave to the reader the (easy) extension to the continuous time case. 
We borrow the proof from that of Proposition 1.1 in Berard and Gouere (2008). Since 
it is simple we include it for convenience. Observe that it is enough to prove the result 
for an arbitrary deterministic initial condition 770 £ Xtv, which we fix below. 

Consider the variables , Ri and Tj introduced in the coupling above. For each 
k > let {i]kn)n>o k e a co Py °f our P rocess ; started at 770, constructed as follows: if 
is given then we let "% n+1 be specified by adding a particle at Vkn(^n+k) + Rn+k and 
then removing the leftmost particle. It is clear then that for any k, I > 0, max 77^ k+i < 
max 77^ + max 77^,. Moreover, for any given d > 1 the family {Vdnd) n >i * s i-i-d-> 
while the distribution of (Vkn) n >o does not depend on k. Now for < k < n define 

f£n = ^Kn-k- Then max ^fc+i < max ^fc + max Cfe+«' the family ( maX ^,d(n.+l))n>l 

is i.i.d for any d > 1 and the distribution of (max ^kn+h) n >0 does not depend on k. It 
is not hard to check that max ^ n satisfies the rest of the hypotheses of the subadditive 
ergodic theorem (see Theorem 6.6.1 in Durrett (2004)) and thus lim n _ +00 max^/n 
exists almost surely and in L 1 , and moreover the limit is non-random. Since {Co n ) n >o 
has the same distribution as ( r ?^ r ) n>0 > the same holds for linin^oo max 77^ jn. 

The above proof can be straightforwardly adapted to obtain the existence of the 
limit for min?7j jt. To show that the two limits are equal it is enough to prove that 
(max 77^ — mmn^/t — ► in probability as t — > 00. Observe that if we follow the 
genealogy of the particle at max 77^ back in time and go back in time N generations 
then we will necessarily reach a particle that is not in 77^ (at time t) , and that is thus to 
the left of mm rj^ . If we call Xt the position of this particle and let Nt be the number 
of branchings in the system up to time t, then clearly max 77^ — X t < X^JjVt-iV l-^l- 
Thus for any e > we have that 



maxT?, -mm 77, > \ p ^ > ^ for ^ Nt _ N <i< N ^ +1 



N^t >£ 



N 



Ri\<Vi) +ljv_ >0. 



v q >£ t->oo 

The monotone coupling introduced above allows to deduce that a at is non-decreasing. 
On the other hand, in the case N = 1 we have that 77^(1) is simply a random walk 
jumping at rate 1 whose jump distribution is that of -Ri V 0. Therefore E(?7^(l)) = bt 
with b = J °° xp(x) dx > 0, and thus ajy > a\ = b > for all N > 1. □ 

To prove parts (b) and (c) of Theorem 2 we will work with the discrete time version 
of the shifted process: A^ = (A^(l), . . . , A^(iV)) with 

Recall that A^ is a Markov chain and its transitions are the same as those of r/^ except 
that after erasing the leftmost particle the N remaining particles are shifted to the left 
so that the new leftmost one is at the origin. 

Proposition 3.1. A^ is a positive recurrent Harris chain. 
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Proof. Following Athreya and Ney (1978), in order to show that A^ is Harris recurrent 
we need to show that there is a set A C Xn such that 

(i) P«(r A < oo) = 1 for all f G X N , where t a = inf{?i > 0: A^ G A}. 

(ii) There exists a probability measure q on A, a A > and a k G N so that P^(Af G 
fl) > Ag(B) for all £ G ^ and all BCA 

To achieve this, choose some L > so that 5 = p((0, L)) > and let 

A = {£ G X N : - £(i + 1) G (0, L) for i = 1, . . . , N - 1 and f (JV) = 0} . 

Then for any initial condition A^ G Xtv we can get to A in N — 1 steps via the 
following path: at time 1 we choose to branch the rightmost particle (the one at Aq^I), 
which happens with probability iV^ 1 ) and send the newborn particle to a location 
x\ G (A^(l), Aq^I) + L) (which happens with probability at least 5). Next we branch 
the particle at x\ and send the newborn particle to a location x<i G {x\,x\ + L) (which 
happens with probability at least S/N). If we continue this for N — 1 steps we will end 
up with a configuration in A, and thus 

/ r \ N- 1 

(3.1) pf(ASf_ ie A)> (± 

The bound is independent of the initial condition £, so by the Borel-Cantelli Lemma it 
follows that (i) holds. Moreover, if B C A is of the form B = {£ G Xn '■ — + 1) G 
Bi G (0, L) for i = 1, . . . , N — 1 and £(N) = 0}, then the preceding argument implies 
that 

F^(Ai ieB )>*^, 

so by taking A = N~ N+1 and q to be the normalized Lebesgue measure on the first 
N — 1 coordinates of the configurations in A, we deduce that (ii) also holds. 

To check that A^ is positive recurrent it is enough to check that sup^^ E^(ta) < 
oo. This follows from (3.1) and the strong Markov property by writing 

(i+l)N-l 

E ^ t a) = Yl F ^ TA >n) <N -1 + ^2 Yl F ^ TA - n ) 

n>l i>l n=iN 



<N-1 + (N-1)Y^ 



1 



N-l 



N ,N-1 



(JV-1)( T ) <oo 



t>i 

for any £ G %n- □ 

Proof of Theorem 2(b). The result now follows from Proposition 3.1. The fact that A^ 
is positive recurrent implies that the invariant measure whose existence is assured by 
the Harris recurrence is finite. The absolute continuity of /ijv is a direct consequence of 
Theorem 2(c), which we prove below, together with the fact that if the initial condition 
for A^ is absolutely continuous, then so is the distribution of the process at all times. □ 

Proof of Theorem 2(c). Let A C Xjv and k = N — 1 be the objects which we found 
satisfy (i) and (ii) in the proof of Proposition 3.1. It is enough to prove the result 
along the k subsequences of the form (^km+j) m >o w ^ n < 3 < k. Moreover, using 



22 



the Markov property at time j we see that it is enough to prove the result along the 
subsequence (A£^J m>0 , which is an aperiodic recurrent Harris chain. The result for 
this subsequence follows from Theorem 4.1(h) in Athreya and Ney (1978) as long as we 
have that sup^P^T/i > t) < 1 for some t > 0, where ta is the hitting time of A. This 
follows easily from the estimate in (3.1) (which is uniform in £). □ 



4 Proof of Theorem 4 



Recall that in this part we are assuming that p (and hence its tail distribution R) 
has exponential decay and, consequently, that the moment generating function of p, 
4>(6) = f^°oc e9x P( x ') is finite for 6 G (—0,9) (see (1.2) and (1.7)). Before getting 
started with the proof of Theorem 4 we need to prove the claim we implicitly made in 
(1.6). 

Lemma 4.1. 

• 0(A) 
mm — - — = a, 

Ae(o,e) A 

where a is the asymptotic speed defined in Theorem 3. Moreover, letting A* G (0, 0) be 
the number such that (f>(X*)/X* = a, we have that 4>'(X*) = a, (p(X)/X is strictly convex 
on (0,0) and the sign of (f)'(X) — (f>(X)/X equals that of X — X* . 

Proof. Define c(A) = (f)(X)/X for A G (0,A*). A little calculus shows that c(A) is strictly 
convex: 

/w = ffl_W + 2#| = i£ [(Jl _ 1)!+1)] e ^ (x) dx > 

It is clear that c(A) — > oo as A — > 0. On the other hand, (1.3) implies c(A) — > oo as 
A — ► 0— as well. Thus the minimum of c is attained at some A* S (0, 0), and we have 
c'(A*) = 0, or (j)'(X*) = 4>(X*)/X*, which will give the second claim in the lemma once 
we show that c(A*) = a. 

Now recalling the definition of a we need to show that 

sup [0c(A*) - (f)(6)] =0. 

6»>0 

This is easy: using the definition of c we get 

sup [0c(A*) - (f)(9)] > X*c(X*) - <f>(X*) = 0, 

while for all 6 > 

6c(X*) - (f)(6) < 6c(6) - (f)(6) = 0. 
Finally, to get the last claim in the lemma recall that c'(A*) = and c is convex, so 

m - ^ = Ac'(A) 

is negative for A G (0, A*) and positive for A G (A*, 0). □ 
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As we explained in Section 1.3 the proof of Theorem 4 will depend on looking for 
positive solutions to the equation (1.8). We will actually consider a slightly more general 
equation: 



(4.1) U(x)= U(y)k(x-y)dy Vx > 0, 

Jo 

where we look for a non-decreasing solution U, continuous except at the origin, with 
U (x) = for all x < 0. In Spitzer's terminology, a solution U with the above properties 
is a P*-solution of (4.1). When \\m x — >oa U{x) = 1 we call U a P-solution, and think 
of it as the distribution function of a non-negative random variable. Here are the two 
results of Spitzer that we will need. 

Theorem 4.2 (Theorems 2 and 4 in Spitzer (1957)). 

(a) If j^° oQ xk(x)dx < then there is a unique (up to multiplicative constant) P*- 
solution of (1.5). 

(b) If xk(x) dx < then there is a unique P-solution of (1.5) which can be ob- 
tained as the limit U{x) = lrm n _ >00 U n (x) of the iterative procedure defined by 
U n+ \{x) = J °° U n (y)k(x — y) dy starting with an arbitrary continuous Uq corre- 
sponding to the distribution function of a non-negative random variable. 

(c) If J^ D oo xk(x) dx > then (1.5) has no P-solution. 

Repeating the arguments we used to show that F(t, x) had a density (see (2.15)) we 
see that if U is a P*-solution of (4.1) then there is a non-negative function u such that 
U{x) = u(y) dy. Again repeating previous arguments (see the first part of the proof 
of Theorem 1), we deduce that u satisfies (1.8), while obviously u(x) = for x < 0. 
u is continuous except possibly at the origin by the dominated convergence theorem 
thanks to the fact that k is continuous. Multiplying u{x) by e Xx will allow us to obtain 
a solution for (1.5) with the desired properties. 

Proof of Theorem 4- Let c > a and take A E (0, A*] such that <j)(X)/X = c as above. The 
uniqueness of the solutions of (1.5) in this case follows from Theorem 4.2(a). In fact, if 
w\ and W2 are two solutions of (1.5) then U{(x) = e Xx W{(x) solves (1.8) for i = 1,2, and 
thus the functions Ui(x) = Jq Ui(y) dy are P*-solutions of (4.1), and they are continuous 
because the U{ are locally integrable. Hence U\{x) = AU^x) for all x G M and some 
A > 0, which implies that u>i(:e) = Aw2(x) for all x G KL Integrating this relation we 
get A = 1 and uniqueness follows. 

To show existence we start by integrating by parts to obtain 

/oo \ poo /'OC / i \ 

xk(x) dx = —— / xe Xx R(x) dx = —— \ i-- e Xx p(x) dx. 
-oo <A(A) J-oo <t>WJ~oo\ V 

Thus the sign of J^ D oQ xk(x) dx is the same as that of <^>'(A) — ^(A)/A. This last quantity 
is strictly negative for A G (0, A*) and vanishes for A = A* by Lemma 4.1. 

If c > a, then A < A* and thus Theorem 4.2(b) provides us with a P-solution of 
(4.1) to which, by the discussion preceding this proof, we can associate a continuous 
(except at the origin) function u satisfying (1.8) and corresponding to the density of 
a non-negative random variable. Now let A = J °° e~ Xx u(x) dx which is clearly finite 
(actually A < 1). Define w(x) = A~ l e~ Xx u(x). Then w is the density of a non-negative 
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random variable and it is easy to check that it satisfies (1.5a) with c = (p(X)/X. w 
obviously satisfies (1.5b), (1.5c) and (1.5d) as well, and it is continuous except possibly 
at the origin because so is u and, by definition, f °° e Xx w(x) dx = A~ l < oo. The last 
thing left to show in this case is that w is differentiable (except at the origin), but this 
follows easily from writing, for x > 0, 



oo poo 



^ roo poo -y 

w(x) = - w(y) p(z) dz dy = - I I w{y)p(z - y) dy dz, 
c JO Jx-y c Jx JO 

and using the fact that the integrand above is continuous. 

To prove the second claim in (b) we may obviously assume that A' E (A, A*). Suppose 
that w solves (1.5) and let W(x) = w(y) dy. It is not hard to check that W satisfies 

W{x) = - W(y)R(x-y)dy Vi > 0. 

J —oo 

Then if A — sup x >Q e x VF(x) < co we have that 

. ~ 1 poo _ _ A poo _ 

W(x)e Xx = - W(y)e Xy e x{x - y) R(x - y) dy < - e x{x ~ y) R(x - y) dy 

C J — oo C J ~oo 

C J —oo C A 

Taking supremum in x > and recalling that c = <f>(\)/\ the above says that 

A' 0(A) A ' 

and since A > this says that 4>(\)/X > 4>(X)/X. But Lemma 4.1 implies exactly the 
opposite. This is a contradiction, and thus A = oo. 

The case c = a is similar so we will skip some details. Now we have A = A* and 
thus xk(x)dx = 0. Theorem 4.2(a) provides us now with a P*-solution of (1.5) to 
which corresponds a function u which is the density of a measure supported on [0, oo) 
and which satisfies (1.8). Let A = e~ x x u{x) dx. We need this quantity to be finite, 
so that w(x) = A~ l e~ x x u(x) be a (continuous) probability density. This follows from 
Theorem 6.2 of Engibaryan (1996), which assures that U(x) < Cx for some C > 0, and 
integration by parts: 

OO POO 

e~ x ' x u{x) dx = lim e~ x * x U(x) - 17(0) + A* / e~ x * x U(x) dx 
o x ^°° Jo 

/•oo 

< CX* / e~ x * x xdx < oo. 



It is easy again to verify that w satisfies (1.5), and its differentiability follows from the 
same reasons as above. Clearly J* e x * y w(y) dy = A~ l J^u{y)dy = O(x), so w is a 
solution with the desired properties. 

We are only left showing (d), that is, that there are no solutions of (1.5) when c < a. 
We start by observing that if w were a solution then the above arguments would imply 
that w is differentiable on the positive axis, and thus if we set fo{x) = w(x) in (FB) we 
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get j(t) = ct. Therefore, to show the non-existence of solutions for c < a it is enough 
to show that, given any e > and any /o supported on [0, oo), there is a T > such 
that the solution (f(t, x), "f(t)) of (FB) satisfies 

(4.2) 7 (T)>(a-e)T. 

To achieve this consider the following construction. Take M > 1 and let xo be 
such that fo(x) dx = 1/M. Now recall the definition of the process Vt hi the proof 
of Proposition 2.1, which corresponded to the deterministic measure valued limit of 
Vf , and observe that we can run this process started with any initial measure (not 
necessarily an absolutely continuous one). Moreover, (2.2) still holds in this case by 
Theorem 5.3 of Fournier and Meleard (2004). Letting Vq = M~ 1 5 xo (a point mass at xo 
of size 1/M), run the process % up to time t = log(M) and define xi to be such that 

^log(Af)([Xl,°o)) = V M - 

We need to prove two things, which together will imply (4.2). The first one is 
that 7(log(Af)) > xi- To see that this is indeed the case, consider a copy of the 
process u t started at vq (which is the probability measure corresponding to /o) and a 
copy of V t started at M~ l 5 Xo . It is clear that Vq H i/q, in the sense of the ordering 
introduced in Section 3, and since the mass of V t is bounded by 1 for t € [0, log(M)], the 
evolution of these processes preserves this inequality up to time log(M). This implies 
that xi < 7Q°e{M)). The above arguments can be made precise using some of the 
ideas introduced in the proof of Theorem 1, but we leave the details to the reader. 

Next we need to prove that 

(4.3) Xl >(o-e)log(M) 

as long as M is large enough. This together with the bound of the preceding paragraph 
imply (4.2) by taking T = log(Af). To show (4.3) we may assume, by the translation 
invariance of Vt, that xo = 0. Observe that Vt, started with Vq = 5q, corresponds to the 
mean measure of the branching random walk £j (started with just one particle located 
at the origin). This can be made precise by roughly writing down the formula for the 
generator of and applying it to functions of the form £ ^ (£></?) — Ei=i ^(^W)) 
where iV(£) is the number of particles in the branching random walk configuration £, to 
deduce that after taking expectations the resulting equation is the same as (2.2). We 
leave the details to the reader, and instead only state that the above implies that 

(4.4) V t ([ct,oo))=E(£([ct,oo))) 

for every c, where £j Qci, oo)) denotes the number of particles in the branching random 
walk to the right of ct at time t. On the other hand it is well known that 

E(£([ct,oo)))=e t P(S t >ct) 

where St is defined as in (1.4), and thus (1.4) implies that 

hm ilog(E(e t 1 ([ct,oo)))) =A(c) + l. 

Since A is strictly decreasing on [0, oo) and A(a) = —1 we deduce that 

a = supjc > 0: lim - log(E(^([ct, oo)))) > ol . 
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This together with (4.4) implies that 

(4.5) u t {[(a-e)t,oo))>l 

for large enough t. On the other hand, if we start % at M~ 1 5$, then xi > i a ~ e ) log(Af) 
if and only if ^i og (M)([( a — e ) log(M), oo)) > 1/M. This is equivalent to starting D t at 
<5o and asking for t'i og (M)([( a ~~ £ ) l°g(-^)) 00 )) > 1) which holds for large enough M by 
(4.5). We have obtained (4.3) and thus the proof is complete. □ 
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